Asynchronous-transition Hmm for Acoustic Modeling

نویسندگان

  • Shigeki Sagayama
  • Shigeki Matsuda
  • Mitsuru Nakai
  • Hiroshi Shimodaira
چکیده

We propose a new class of hidden Markov model (HMM) which we call Asynchronous-Transition HMM (AT-HMM) to model asynchronous temporal structure of acoustic feature sequences. Conventional HMM models a sequence of feature vectors, while temporally changing patterns of acoustic features do not necessarily synchronize with each other. In this paper, AT-HMMs with and without sequential constraints are discussed. Algorithms for generating context-dependent AT-HMM and for deriving sequentially constrained AT-HMM are provided. A new concept of “state tying across time” is also introduced. Speaker-dependent speech recognition experiments demonstrated error reduction rates of more than 30% and 40% in phoneme and isolated word recognition, respectively, compared with conventional HMMs.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Asynchronous-transition HMM

We propose a new class of hidden Markov model (HMM) called asynchronous-transition HMM (AT-HMM). Opposed to conventional HMMs where hidden state transition occurs simultaneously to all features, the new class of HMM allows state transitions asynchronized between individual features to better model asynchronous timings of acoustic feature changes. In this paper, we focus on a particular class of...

متن کامل

Maximum entropy direct model as a unified model for acoustic modeling in speech recognition

Traditional statistical models for speech recognition have been dominated by generative models such as Hidden Markov Models (HMMs). We recently proposed a new framework for speech recognition using maximum entropy direct modeling, where the probability of a state or word sequence given an observation sequence is computed directly from the model. In contrast to HMMs, features can be non-independ...

متن کامل

Bernoulli versus Markov: Investigation of state transition regime in switching-state acoustic models

In this paper, a new acoustic model called time-inhomogeneous hidden Bernoulli model (TI-HBM) is introduced as an alternative to hidden Markov model (HMM) in continuous speech recognition. Contrary to HMM, the state transition process in TI-HBM is not a Markov process, rather it is an independent (generalized Bernoulli) process. This difference leads to elimination of dynamic programming at sta...

متن کامل

Advanced Acoustic Modeling with the Hybrid HMM/BN Framework

Most of the current state-of-the-art speech recognition systems are based on HMMs which usually use mixture of Gaussian functions as state probability distribution model. It is a common practice to use EM algorithm for Gaussian mixture parameter learning. In this case, the learning is done in a ”blind”, data-driven way without taking into account how the speech signal has been produced and whic...

متن کامل

Implicit Trajectory Modeling through Gaussian Transition Models for Speech Recognition

It is well known that frame independence assumption is a fundamental limitation of current HMM based speech recognition systems. By treating each speech frame independently, HMMs fail to capture trajectory information in the acoustic signal. This paper introduces Gaussian Transition Models (GTM) to model trajectories implicitly. Comparing to alternative approaches, such as segment modeling and ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999